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(57) Abstract: The present invention is related to a new detection and/or quantification method of a nucleotide sequence from other 
homologous sequences, and being present in a biological sample, comprising the steps of: amplifying or copying at least a part of 
original nucleotide sequences present in the biological sample into target nucleotide sequences; putting into contact the obtained 
target nucleotide sequences with corresponding capture nucleotide sequences bounded to an insoluble solid support, characterised in 
that said capture nucleotide sequences have a (single stranded) length comprised between about 40 and about 400 bases, and in that 
said capture nucleotide sequences are bounded to the insoluble support according to an array with a density of at least 5 different 
bound capture nucleotide sequences / cm 2 surface of solid support; and detecting and/or quantifying a signal resulting from the 
fonnation of double stranded nucleotide sequences resulting from the hybridisation by complementary base pairing. 
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METHOD AND KIT FOR THE SCREENING AND/OR THE QUANTIFICATION 
OF MULTIPLE HOMOLOGOUS NUCLEIC ACID SEQUENCES ON ARRAYS 

10 Field of the invention 

[0001] The present invention is related to a method 

and kit comprising reagents for simultaneous detection 
and/or quantification of multiple homologous nucleic acid 
sequences on arrays . 

15 

Background of the invention and state of the art 
[0002] The development of miniaturisation, 

especially the array technology, allows the simultaneous 
detection of a great number of nucleotide sequences present 

20 in biological samples. The array contains on its surface, 
series of discrete regions bearing capture nucleotide 
nucleotide sequences able to bind by hybridisation 
corresponding target nucleotide sequences. If the last ones 
are labelled, a signal can be detected and measured at the 

25 binding location and its intensity gives an estimation of 
the amount of target sequences present in the sample. 
[0003] It is known that the presence of a particular 

oligonucleotide sequence at a specific location on a 
support can be obtained by using masks at each step 

3 0 comprising the addition of a new nucleotide on a growing 
oligonucleotide in order to obtain the desired sequence. 
This mask technique is derived from the photolithographic 
technology and is coupled with the use of photoprotective 
groups, which are released before a new nucleotide is 
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added. The obtained array can contain several thousands of 
discrete domains, each one bearing a specific 
oligonucleotide as capture nucleotide sequence for the 
possible detection of thousands of target sequences. Only 
5 small oligonucleotides are present on the surface, and are 
used for sequencing or identifying sequences in a pattern 
of positive spots corresponding to specific 

oligonucleotides present on the array. The identification 
of the target sequence is made by comparison of said 

10 pattern with a reference. The documents W097/29212 and 
W098/28444 describe a similar technique with the use of 
capture nucleotide sequences comprising less than 30 
nucleotides allowing an analysis of two different sequences 
that may differ by a single nucleotide. Small capture 

15 nucleotide sequences (between 10 and 2 0 nucleotides) are 
preferred since the discrimination between two 
oligonucleotides differing in one base is higher, when 
their length is smaller. 

[0004] However, for longer target fragments (like 

20 the cDNA copy of mRNA) , several fragments can recognise the 
same nucleotide sequence and the rate of hybridisation is 
lower. Therefore, the fragments are cut into smaller 
species and the method should require the use of numerous 
capture nucleotide sequences in order to obtain a pattern 
25 of signals which attest the presence of a given gene 
(WO97/10364 and W097/27317) . In this case, short capture 
sequences are used, preferably about 4 5 nucleotides, and in 
practice, between 10 and 25 nucleotides. 

[0005] The document US-5,605,662 describes the 

30 binding of full nucleotide sequences on plots of electronic 
based chips, each of the plots bearing a specific 
oligonucleotide sequence. A target sequence is spotted on 
the chips and specific small labelled nucleotides are then 
incubated in order to determine their binding to the 
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target. A change in the plot charge allows a discrimination 
of the different nucleotide differing in one base, which 
allows a specific SNP analxsis (Gilles et al . 1999, Nature 
Biotechnology 17, p. 365 (1999)). 
5 [0006] The document W094/22889 describes electronic 

chips made of preconstructed oligonucleotides bearing a 
pyrol group used in an electropolymerisation step and 
allows the formation of a conducting polymer. This method 
is particularly suitable for the preparation of electronic 

10 chips bearing small capture nucleotide sequences. 

[0007] The approaches using either small or long 

capture nucleotide sequences fixed on array, are not 
optimal conditions when applied for diagnostic purpose, 
. especially, when different sequences having a high homology 

15 between them should be detected and possibly quantified at 
the same time upon the same microarray. 

Aims of the invention 

[0 008] The present invention aims to provide a new 

2 0 method to improve the detection and possibly the 
quantification of nucleotide sequences, preferably multiple 
homologous sequences, coming from different or the same 
organism, by hybridisation on single stranded capture 
nucleotide sequences, and which does not present the 
2 5 drawbacks of the state of the art. 

Summary of the invention 

[0009] The inventors have discovered that it is 

possible to provide a sensitive detection method (possibly 
30 combined with a quantification) of multiple nucleotide 
sequences upon an array, even if said multiple sequences 
are homologous and are present simultaneously in the sample 
submitted to the analysis. Until now, either the 
sensitivity was low or null or there was cross-reactivity 
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between the different (but homologous) target sequences on 
the same capture nucleotide sequences. 

[0010] One unexpected observation was the result 

obtained when a given target sequence was incubated with an 
5 array bearing corresponding capture nucleotide sequences of 
various length (example 2 and figure 1) . With a target 
sequence made of a double stranded amplicon (just denatured 
before the experiment) , either the use of too long or too 
small capture nucleotide sequences gave a lower signal (and 

10 sometimes a much lower signal) compared to the one obtained 
when capture nucleotide sequences have the same length as 
the target sequence. The highest sensitivity was obtained 
with capture nucleotide sequences with moderate length and 
a size similar to the target amplicon. For very long target 

15 amplicons, the situation is much more complicated 
(reassociation of the amplicons, secondary structures, . ..) 
and depends on the nature of the sequence. In sandwich 
hybridisation of long target amplicons upon microplates, 
optimal capture nucleotide sequences have been found to be 

20 between 50 and 500 bases. (W098/11253) . 

[0011] The present invention is related to an 

(improved in sensitivity and specificity compared to known 
techniques) detection and/or quantification method of a 
nucleotide sequence from at least 4 others homologous 

25 sequences, possibly present in a biological sample, by the 
identification of a portion of its sequence, said method 
comprising the steps of: 

- possibly extracting original nucleotide sequences from 
the biological sample, 

30 - amplifying or copying at least a part (or portion) of 
the original nucleotide sequences into target nucleotide 
sequences, each possible target nucleotide sequence 
being located at a different location of the sequence. 



WO 01/31055 



PCT/BEOO/00123 



- possibly denaturing the double stranded target 
nucleotide sequences into single target nucleotide 
sequences , 

- possibly cleaving said sequences into smaller target 
5 nucleotide sequences having a length higher than about 

40 and lower than about 400 base pairs (or at a 
preferred size described hereafter) by various methods 
(restriction enzymes, addition of NaOH, etc.), 

- possibly labelling said target nucleotide sequences, 

10 - putting into contact the obtained target nucleotide 
sequences with corresponding capture nucleotide 
sequences bounded (linked or fixed) to an insoluble 
solid support, said capture nucleotide sequences having 
a (single stranded) length comprised between about 40 

15 and about 400 bases, preferably, between about 50. and 

about 3 50 bases, more preferably between about 10 0 and 
about 3 00 bases, even more preferably between about 12 0 
and about 200 bases, said capture nucleotide sequences 
being bound to the insoluble solid support according to 

20 an array with a density of at least 5 different bound 

capture nucleotide sequences/cm 2 surface of solid 
support . 

[0012] The method comprises also the step of 

detecting and/or possibly quantifying a signal (preferably 

25 from the label of the target nucleotide sequences) 
resulting from the formation of double stranded nucleotide 
sequences resulting from the hybridisation (of the capture 
and target nucleotide sequences) by base pairing. 
[0013] The labelling (with known techniques from the 

30 person skilled in the art) is preferably also obtained upon 
the amplified sequence previously to the denaturation (if 
the method comprises an amplification step) . 
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[0014] The method according to the invention may 

also further comprise the step of correlating the signal of 
detection to the identification (location) of the specific 
target nucleotide sequence (one spot = 1 specific sequence) 
5 without comparing the hybridisation pattern obtained with a 
standard pattern as described in the state of the art 
(WO97/02102) , to the presence of specific microorganisms, 
or genetic characteristics (polymorphism, genetic diseases, 
diagnostic and monitoring of cancer, etc.) for a patient 

10 (from which the biological sample has been obtained) . The 
biological sample can be any culture medium wherein 
microorganisms or pollutants are present, or an extract 
obtained from a plant or an animal (including a human) 
organ, tissue, cell or biological fluid. 

15 [0015] The various steps of the method according to 

the invention can be done with various means well known by 
the person skilled in the art and described in the 
literature. 

[0016] The method according to the invention can be 

2 0 performed by using a specific diagnostic and/or 

quantification kit comprising at least an insoluble solid 
support upon which capture nucleotide sequences are 
disposed (preferably bounded to the solid support by a 
covalent link) according to an array with a density of at 
25 least 5 different bound capture nucleotide sequences / cm 2 
surface of the insoluble solid support, said capture 
nucleotide sequences having a length comprised between 
about 4 0 and about 400 bases or preferably a length as 
above-described. 

3 0 [0017] In the method and kit according to the 

invention, the density of the capture nucleotide sequences 
upon the array of the solid support can be increased, for 
instance by having more than 10, 20, 50, 100, 1000, 10000 
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or more capture nucleotide sequences/cm 2 surface of solid 
support. In the preferred embodiment, each capture 
nucleotide sequence is located in a specific area of the 
support. However, several capture nucleotide sequences can 
5 also be present in the same area to obtain specific 
information. 

[0018] The kit according to the invention may also 

incorporate various media or devices for performing 
(preferably automatically) the method according to the 
. 10 invention. Said kit can be included in an automatic device, 
such as a high throughput screening apparatus for the 
detection and/or the quantification of multiple nucleotide 
sequences present in a biological sample to be analysed. 
. Said kit or apparatus can be adapted for performing all the 
15 steps or only several specific steps of the method 
according to the invention. 

[0 OlS] In the method, kit or apparatus according to 

the invention, the length of the single stranded capture 
nucleotide sequences is preferably identical to the length 

2 0 of the target nucleotide sequences to be detected and/or 
quantified or may differ, preferably by less than 50% in 
total length, more preferably less than 30% in total 
length, even more preferably less than 10% in total length. 
[0020] The method, kit or apparatus according to the 

25 invention are suitable for the detection and/or the 
quantification of target nucleotide sequences which are 
made of DNA or RNA, including sequences which are partially 
or totally homologous upon their total length. 
[0021] The method according to the invention can be 

30 performed even when the different original nucleotide 
sequences present between them an homology greater than 
30%, greater than 60% or even greater than 80% or differ by 
few bases . 
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[0022] In the method, kit or apparatus according to 

the invention, the capture nucleotide sequences are 
advantageously covalently bounded (or fixed) upon the 
insoluble solid support , preferably by one of their 
5 extremities as described hereafter. 

[0023] With the method according to the invention, 

the yield of hybridisation is advantageously greater than 
10%, 50%, 70%, 80% or 90% or can achieve almost 100%. 
[0024] The method, kit and apparatus according to 

10 the invention may comprise the use of other bounded capture 
nucleotide sequences (i.e. by allowing an hybridisation 
with a standard nucleotide sequence used for the 
quantification, with a consensus sequence of different 
micro-organisms strains or with a sequence allowing a 

15 detection of a possible an antibiotic resistance by micro- 
organisms) or other non- homologous sequences, said other 
capture nucleotide sequences having possibly a length 
higher than 400 bases and being also bounded upon the 
insoluble solid support (biochip) , preferably in the array 

20 made with the other bounded capture nucleotide sequences. 

[0025] The solid support according to the invention 

can be or can be made with materials selected from the 
group consisting of glasses, electronic devices, silicium 
or plastic support, compact discs, filters, metallic 

25 supports or a mixture thereof. Advantageously, said solid 
support is . a single glass plate which may comprise 
additional means (barcodes, markers, etc.) or media 
(coating, etc.) for improving the method according to the 
invention. 

30 [0026] The amplification step(s) used in the method 

according to the invention is advantageously obtained by 
well known amplification protocols, preferably selected 
from the group consisting of PCR, LCR, CPR, NASBA, ICR or 
Avalanche DNA techniques. 
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[0027] Advantageously, the length of the target 

nucleotide sequences to be detected is determined by the 
conditions of the above- identified amplification protocols 
determined by the use of specific primers (and possibly 
5 blocking nucleotide sequences) for a retro- transcription of 
the 3' or 5* end of the original biological nucleotide 
sequences to be detected and/or quantified, especially if 
it is a RNA sequence. Advantageously, said RNA sequences 
are 16S and 23S rRNA sequences or 18S and 28S rRNA 
10 sequences. 

[0028] In a preferred embodiment of the present 

invention, the target nucleotide sequences to be copied or 
amplified are obtained from different parts or portions of 
the corresponding (homologous) DNA or RNA original 
15 biological nucleotide sequence. 

[0029] In a preferred embodiment of the invention, 

the primers used for the amplification preferably bear at 
their 3' end a base specific of one of the homologous 
sequences . 

20 [0030] According to a further aspect of the present 

invention, the method, kit or apparatus according to the 
invention is advantageously used for the detection and/or 
the quantification of different Staphylococci species or 
variants, preferably the Staphylococcus aureus, the 

2 5 Staphylococcus epidermidis, the Staphylococcus 

saprophyticus, the Staphylococcus hominis or the 
Staphylococcus haemolyticus present together or separately 
in a biological sample, said detection being obtained by 
detecting the genetic variants of the FemA gene in said 

3 0 different species, preferably by using specific locations 

in the FemA genetic sequence, as described in the document 
WO99/16780 incorporated herein by reference. 
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[0031] Preferably, the primers and the specific 

portions of said FemA sequence used for obtaining amplified 
products are the ones described hereafter in the examples. 
[0032] The method according to the invention may 

5 also comprise means for obtaining a quantification of 
target nucleotide sequences by using a standard nucleotide 
sequence (external or internal standard) which can be 
brought into contact with the capture nucleotide sequences 
bounded upon the array of the solid support in the same 

10 conditions as said target nucleotide sequences (possibly 
after amplification or copying) . Said method comprises a 
step of quantification of a signal resulting from the 
formation of a double stranded nucleotide sequence formed 
by complementary base pairing between the capture 

15 nucleotide sequences and the standard nucleotide sequences 
and a step of a correlation analysis between the signal 
resulting from the formation of said double stranded 
nucleotide sequence and the signal resulting from the 
double stranded nucleotide sequence formed by complementary 

2 0 base pairing between capture nucleotide . sequences and 

target nucleotide sequences in order to quantify the 
presence of the original nucleotide sequence to be detected 
and/or quantified in the biological sample (see also the 
document W098/11253 incorporated herein by reference) . Said 
25 standard nucleotide sequence (external and/or internal 
standard) is advantageously included in the kit or 
apparatus according to the invention, possibly with the 
media and device (s) necessary for performing the different 
steps according to the invention, such as the hybridisation 

3 0 and culture media, polymerases, enzymes, standard sequences 

and labelling molecules. 

[0033] The present invention will be described in 

details in the following non-limiting examples in reference 
to the enclosed figures. 



1 
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Brief description of the drawings 

[0034] Fig. 1 shows the influence of the capture 

nucleotide sequence length on the yield of hybridisation of 
target amplicons. The target sequences were 155 bases long 
5 and 100 fmoles were incubated on a chips containing single 
stranded capture nucleotide sequences going from 23 to 437 
bases long. The length of the capture nucleotide sequence 
is shown on the figure. 

[0035] Fig. 2 schematically represents the FemA 

10 detection of 5 different species of Staphylococci. The 
locations of the 5 pairs of primers used for specific 
amplification of one of the 5 different sequences belonging 
to the 5 Staphylococci species are shown on the FemA 
sequence. A consensus sequence is also amplified by using 2 
15 primers common to all Staphylococci species . The labelled 
amplified sequences are then hybridised on the chips. 
[0036] Fig. 3 schematically represents the detection 

of rRNA by the copy of small portions of the sequence 
either at its 5' end using one starting nucleotide sequence 
20 or along its sequence using both a starting nucleotide 
sequence and a blocking nucleotide sequence. The small 
copied sequences are then hybridised on the array of the 
chips . 

25 Definitions 

[0037] The term "nucleoside triphosphate" refers to 

nucleosides present in either DNA or RNA and thus includes 
nucleosides which incorporate adenine, cytosine, guanine, 
thymine and uracil or other modified bases (i.e. 

30 8-azaguanine and hypoxanthine) as bases, the sugar moieties 
being deoxyribose or ribose. 

[0038] The term "nucleotide" as used herein refers 

to nucleosides present in nucleic acids (either DNA or RNA) 
compared with the bases of said nucleic acid, and includes 
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nucleotides comprising usual or modified bases as above 
described. References to nucleotide (s) , oligonucleotide (s) 
and the like include analogous species wherein the sugar- 
phosphate backbone is modified and/or replaced, provided 
5 that its hybridisation properties are not destroyed (the 
backbone may be replaced by an equivalent synthetic 
peptide, Peptide Nucleic Acid (PNA) ) . 

[003 9] The primer sequence need not reflect the 

exact sequence of the template to be amplified or copied 

10 provided that under hybridising conditions the primers can 
be used in a genetic amplification. Mismatched bases can be 
introduced into the primer sequence to provide altered 
hybridisation introduced into the primer sequence to 
provide altered hybridisation stringency (amplification or 

15 copying of the same regions of homologous sequences by the 
same primers) . 

[0040] The terms "capture or target nucleotide 

sequences, target nucleic acid, hybridising specifically 
to, background, quantifying" are defined as in WO97/10365, 
20 incorporated herein by reference. 

[0041] "Homologous sequences" mean nucleotide 

sequences having the same nucleotide at corresponding 
positions. They are generally defined as a minimum of 
homology (or sequence identity) between sequences wherein 

2 5 . the percentage of identical nucleotides after the sequences 

has been optimally aligned taking into account additions or 
deletions like gaps (for sequences of a given gene, present 
in genetically different sources like different organisms 
or for proteins or enzymes of the same family) . The degree 

3 0 of homology (or sequence identity) can vary a lot as 

homologous sequences may be only in one or several portions 
or all along the complete sequences. The parts or portions 
of the sequences that are identical in both sequences are 
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said conserved. The sequences showing a high degree of 
invariance or homology are said to be highly conserved. 
[0042] Methods of alignment of sequences are based 

on local homology algorithms which have been computerised 
5 and are available as for example (but not limited to) 
Clustal®, (Intelligenetics, Mountain Views, California), or 
GAP®, BESTFIT®, FASTA® and TFASTA® (Wisconsin Genetics 
Software Package, Genetics Computer Group Madison, 
Wisconsin, USA) or Boxshade® . 

10 

Detailed description of the invention 

[0043] The amplification of target sequences of 100 

to 200 bases and hybridisation on capture nucleotide 
sequences comprised between 100 and 200 bases allows to 

15 solve many problems posed by the differential diagnostic of 
homologous sequences. However smaller lengths (especially 
for capture nucleotide sequences) going down to 4 0 bases 
and longer length going up to 400 bases can also be used 
(mostly limited by the amplification step which needs the 

20 use of two primers and some sequences to be copied) . The 
reduction of the capture nucleotide sequence for a given 
target sequence leads to a decrease in sensitivity, but an 
increase in specificity. Longer target sequences limit the 
possibility to design the various targets amplicons on the 

25 same sequence as shown in Fig . 2 . It also reduces 
specificity and can lead to secondary structures decreasing 
the yield of hybridisation. 

[0044] Target DNA sequences are amplified by 

classical methods like PCR using primers so that small 
3 0 fragments are produced which are then used for 
hybridisation on an array bearing the corresponding capture 
nucleotide sequences. RNA can be copied and retro- 
transcripted in cDNA and amplified the same way if 
necessary. 
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[0045] When homology between the sequences to be 

detected in the sample is not too high (typically between 
30 and 60%) , the homologous sequences are amplified or 
copied (for instance by using the same primers) and the 
5 discrimination is obtained on the array using moderate 
length capture nucleotide sequences as proposed in the 
present invention. The use of the same primers is possible 
given that some parts of the sequences are similar or even 
identical on a certain distance and it facilitates the 

10 amplification step. However since the sequences are also 
rather different makes possible to capture them 
specifically on moderate length capture nucleotide 
sequence. The fact that both target and capture nucleotide 
sequences are of similar length makes the detection 

15 sensitive. 

[0046] However when several sequences with high 

homology (for instance higher than 60%) have to be analysed 
the discrimination by hybridisation on capture nucleotide 
sequences only is not always sufficient (cross-reactions 

20 between two homologous sequences on the same capture 
nucleotide sequence) . Therefore, different small fragments 
of the target sequences are either amplified or copied. 
These fragments are located in different part of the 
sequence for the different targets. Some parts of these 

25 locations can be overlapping so that the locations are not 
totally but only partially distinct. This situation is 
compatible with the invention as long as there are no cross 
reactions on the capture nucleotide sequences by the 
different target nucleotide sequences. These small 

30 fragments are then detected on their specific capture 
nucleotide sequences of moderate length. High sensitivity 
is obtained throughout the use of capture nucleotide 
sequences of moderate length similar to the sequences to 
detect. The specificity is obtained first during the 
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amplification using specific primers for each sequence to 
be amplified or copied and secondly on the microarray. For 
a sample analysis, the different primers are used together 
in a multiplex PCR amplification before analysis on the 

- 5 biochips. Preferably, one may use primers specific for each 
target having also at their 3' end a base specific of the 
sequence to detect so that (in appropriate conditions) only 
the target sequence bearing the corresponding base will be 
copied and amplified by the polymerase. 

10 [0047] In an alternative, a semi -consensus multiplex 

PCR was tested by using a primer common for all sequences 
and a second primer specific of each homologous sequence. 
Specific capture nucleotide sequences of moderated length 
were selected for each sequence in a different part of the 

15 overall target sequence. 

[0048] In another alternative, which can be combined 

with the previous mentioned ones, multiple capture 
nucleotide sequences which recognise different parts of a 
t same target are fixed to the array, which contain two or 

20 three capture nucleotide sequences binding to different 
parts of a given amplified sequence. The resulting signals 
for each of these spots are in a given ratio for a given 
target sequence, while it is different for another sequence 
which partly cross -reacts as the homology between two 

25 sequences is not homogeneous (with some parts being very 
different while others are more conserved) . 

[0049] The assay for RNA present in multiple copies 

in cells and in large amount like the 16s or 23s rRNA in 
procaryote cells or the 18s or 28s rRNA in eucaryote cells 
30 does not require necessarily amplification. The detection 
based on their hybridisation on microarray is particularly 
well suited since they show a high degree of homology 
between different species. It is possible to copy one 
specific part of the RNA by using a primer to start the 
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copy and a blocking nucleotide sequence which binds to the 
RNA and stops the copying by the reverse transcriptase 
(example 5). This blocking oligonucleotide has its 3 1 end 
blocked so that it can not be used by the transcriptase to 
5 start a copy. The most simple 3' block being a deoxy 3' 
carbon, but others like the presence of a pyrophosphate at 
the 3' carbon or a 2 1 , 3 ' -dideoxycarbon are also working. 
The 5' end of this blocking oligonucleotides is also 
modified like for example by a NH2 group in order to block 

10 transcriptase. The primer can also bear at its 3' end a 
base specific of a given sequence, in order to make a more 
specific copy. Therefore, different specific parts of the 
rRNA of different organisms can be copied making their 
detection on the array unequivocal since not only their 

15 sequence, but also their location will.be specific. 

[0050] The same copy of the RNA can be obtained with 

messenger RNA (mRNA) which can be transcripted in cDNA and 
amplified by PCR if necessary. 

[0051] The development of chemistry allows the 

20 covalent fixation to glass of nucleotides bearing a 
specific functional group (Lamture et al., Nucleic Acid 
Res. 22, pp. 2121-2125 (1994)). By using aminoterminal 
groups present on the oligonucleotides, it is possible to 
obtain a covalent binding of nucleotides upon aldehyde 
2 5 groups present on the array. The capture nucleotide 
sequences are either chemically synthesised or produced by 
PCR (amplicons) and bounded to a f unctionalised glass by a 
robot and thereafter rendered single stranded. The capture 
nucleotide sequences have complementary sequences related 
30 to the target DNA to detect and have a similar, if not 
identical length. Difference of 50 % in the length still 
gives a high sensitivity binding. The array is constructed 
with an appropriated automate which deposits the capture 
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nucleotide sequences at a given location which delimitates 
a spot. Triplicate of each capture nucleotide sequence plus 
the negative and positive controls (and possible standard 
sequences) are used. The array contains typically between 
5 20 and 100 spots, but arrays going to 400 spots, 1000 or 
10000 spots, or more are possible. An array with 400 spots 
per cm 2 is obtained with pins of 0.2 mm at low cost. As the 
capture nucleotide sequences are present in a sufficient 
number, it allows the lecture of these spots with a great 

10 resolution with known detection apparatus. 

[0052] Other supports like filters either of 

nitrocellulose or nylon are also proposed for arrays. The 
binding sites of the DNA on these filters is however at 
random along the sequence and it is impossible to predict 

15 the size of the available single stranded sequence which is 
then available for hybridisation. 

[0053] Binding of oligonucleotides sequences can be 

obtained through covalent or non- covalent binding on these 
supports in a direct or indirect reaction. One common 

20 method is to spot the capture nucleotide sequences on a 
, surface where polylysine has been attached on glass or 
plastic through the binding of biotinylated 
oligonucleotides on streptavidin coated surfaces or through 
the use of proteins with binding affinity for 

25 oligonucleotides (EP-A-0491059) . 

[0054] Gel layers (US-A-5 , 552 , 270 and EP-0535242) , 

copolymerisation of acrylamide with vinyl bearing 
oligonucleotides (US- A- 5, 73 6 , 2 57) or other supports, which 
can be activated for covalent fixation of oligonucleotides, 

30 can be used for arrays. Plastic like polycarbonate as 
present on CD was activated in order to fix capture 
nucleotide sequences and be used as a bio-CD array 
(W099/35499) . 
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[0055] Hybridisation between two DNA or RNA chains 

is a complex process which is initiated by the binding of a 
few (4-5) nucleotides which recognised themselves in a 
specific way and once they are bound a very fast process 
5 thermodynamically favourable elongation of the binding 
occurs along the sequence. The binding is thus dependent 
both of kinetic and thermodynamic parameters and 
experimental conditions can be adapted in order to modify 
both of them. Temperature accelerates the kinetic process 

10 but there is an optimum for the temperature used to obtain 
a maximum binding of the target. The salt concentration 
modifies the stringency conditions: more salts present in 
the solution, more easy will be the binding of the two 
chains. Preferred conditions are obtained with targets and 

15 capture nucleotide sequences of the same size and of 
moderate length. 

[0056] A very good yield of capture of the target on 

the capture nucleotide sequences can be obtained even if 
the targets are double stranded like after PCR 

2 0 amplification. The hybridisation of the target DNA is a 

competitive reaction: the target strand can hybridise on 
the fixed capture nucleotide sequence but can reassociate 
. in solution with its complementary strand. The reaction in 
solution is always kinetically favourable due to the free 
25 movement of molecules in solution. The rate of reaction is 
proportional to the square root of the length of the 
shorter strand. When capture nucleotide sequences have 
almost similar length as the target, the two reactions are 
independent of the length, since they are the same or 

3 0 similar. An optimum number of the same capture nucleotide 

sequences has been fixed per spot so that the high number 
of capture nucleotide sequences compensates for the loss 
obtained by the diffusion constrains. Moderated length 
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capture nucleotide sequences disposed at a certain distance 
from the surface lower the diffusion effect. 

[0057] Other components of reactive solutions have 

also to be incorporated like buffer (s) , detergent (s) , DMSO 
5 or the addition of non specific DNA like salmon DNA. 

[0058] After hybridisation on the array, the target 

sequences can be detected by current techniques. Without 
labelling, preferred methods are the identification of the 
target by mass spectrometry now adapted to the arrays 
10 (US-A-5, 821, 060) or by intercalating agents followed by 
fluorescent detection (W097/27329 or Fodor et al . , Nature 
364, p. 555 (1993) ) . 

[0059] The labelled associated detections are 

numerous (see review in W0 97/27317) . They are obtained 

15 using either already labelled primer or by incorporation of 
labelled nucleotides during the copying or amplification 
step. A labelling can also be obtained by ligating a 
detectable moiety onto the RNA or DNA to be tested (a 
labelled oligonucleotide ligated at the end of the sequence 

20 by a ligase) . Fragment of RNA or DNA can also be 
incorporate labelled nucleotides at their 5 'OH or 3 1 OH ends 
using a kinase, a transferase or a similar enzyme. 
[0060] Labels like fluorescent nucleotide sequences 

like Cy3, Cy5 and Cy7 are suitable for analysing an array 

25 by using commercially available array scanners (General 
Scanning of Genetic Microsystem) . Radioactive labelling, 
cold labelling or labelling with small molecules recognised 
thereafter by specific ligands (streptavidin or antibodies) 
are common methods. The resulting signal of target fixation 

30 on the array is either fluorescent, colorimetric, 
diffusive, electroluminescent, bio- or chemiluminescent , 
magnetic, electric like impedometric or voltametric 
(US-A-5, 312, 527) . The two preferred embodiments of the 
invention are the fluorescent detection or the gold 
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labelling of the bound target in order to obtain a 
precipitate or silver staining which is then easily- 
detected and quantified by a scanner (EP-99870106 . 4) . 
[0061] The signal obtained for each spot is recorded 

5 and the mean of the signal is calculated for identical 
capture nucleotide sequences. In practice at least two and 
preferably three to five identical spots are present on 
each array in order to correct for variation which can 
occur at any step of the process. The background value is 

10 identified either in the part of the array which has no 
capture nucleotide sequence or on spots bearing non 
specific capture nucleotide sequence (negative control) . A 
positive control is preferably added (a DNA sequence which 
is added to the hybridisation solution and in which capture 

15 nucleotide sequence is present at least on one spot of the 
array) , to test for .hybridisation step, solutions and 
conditions used and detection. Different positive 
nucleotide sequences present at various concentrations can 
also be added to the sample in order to obtain a reference 

20 curve for the signal. The various signals of the spots can 
then be compared to this reference curve. 

[0062] Quantification takes into account the 

hybridisation yield and detection scale on the array (which 
is identical for target and reference sequences) and the 

2 5 extraction, the amplification (or copying) and the 
labelling steps. Internal standard are used in 
quantification by the measurement of the target sequence 
compared to a given sequence (reference) and to which the 
other values will be compared. 

30 [0063] External standard can also be added to the 

sample for the quantification. If PCR is used, an internal 
standard contains at its extremities the same sequences as 
the target in order to be amplified by the same primers. It 
can also be of the same length, has the same GC content or 
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even have a large part of its sequence identical to the 
target in order to be really competitive during the 
amplification step. 

5 Examples 

Example 1: Detection of target nucleotide sequences on an 
array 

[0064] The protocol described by Schena et al (Proc. 

Natl Acad. Sci . USA 93, 10614 (1996)) was followed for the 

10 grafting of aminated DNA to aldehyde derivatised. The long 
aminated capture nucleotide sequences (100-400 bases) were 
spotted at a concentration of 150 nM while the small 
oligonucleotides were at 450 nM. The capture nucleotide 
sequences were printed onto the silylated microscopic 

15 slides with a home made robotic device, using 250 /xm pins 
from Genet ix (UK) and silylated (aldehyde) microscope 
slides from Cell associates (Houston, USA) . The spots have 
400 /xm in diameter and the volume dispensed is about 1 nl . 

Slides were dried at room temperature and stored at 4 °C 
2 0 until used. 

[0065] 5 /xl of hybridisation solution were loaded on 

glass slides bearing the capture nucleotide sequences. This 
mixture contained : SSC2X, SDS 4%, salmon sperm DNA 100 
iig/ml, 2 nM biotinylated CMV amplicons of 437 bp and 10 nM 

25 of biotinylated target amplicons. Microarrays were covered 
with coyerslips prewashed with ethanol 100%. Slides were 
denatured at 95 °C for 5 min. The hybridisation was carried 
out at 65° for 2 h. Samples were washed 4 times with Maleic 
buffer 10 mM pH 7.5, NaCl 15 mM, Tween 0.1%. 

30 [0066] The glass samples were incubated 45 min at 

room temperature with 800 /il of streptavidin labeled with 
colloidal gold. After washing, the presence of gold served 
for catalysis of silver reduction using the staining 
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revelation solution (Sigma St Louis, Mi) . The slides were 
dried before being store at room temperature and analysed 
using a micro array reader. 

5 Example 2: Comparison of the sensitivity obtained for 
hybridisation of a target sequence of medium size (155 bp) 
on capture nucleotide sequences of various length 
[0067] The protocols for capture nucleotide 

sequences immobilisation and silver staining detection are 
10 described in example 1. The capture nucleotide sequences 
and target DNA were obtained by amplification of CMV 
sequence by PCR using the following primers: 



Primer 


Sequence (5 ' -> 3 ' ) 


Length 
(bp) 


Location of 
Primer < d > 


Target a 


MIE-4 

MIE-Sc 

MIE-6C 


CCAAGCGGCCTCTGATAACCAAG 

CAGCACCATCCTCCTCTTCCTCTGG 

GGCGATGGCCCGTAGGTCATCCA 


437 
155 


2223-2245 
2657-2633 
2374-2352 



Capture nucleotide sequence* 3 


MIE-4C 


CCAAGCGGCCTCTGATAACCAAG 




2223-2245 


MIE-6a 


GCGGCGCTTCATTACACTGATAAC 


56 


2275-2252 


MIE-6b 


CGGCCCCCAGAATGTACTGGGCA 


105 


2324-2302 


MIE-6C 


GGCGATGGCCCGTAGGTCATCCA 


155 


2374-2352 


MIE-6d 


GTACAGGGGACTCTGGGGGTGAC 


255 


2474-2452 


MIE-6 e 


CTGCTCACTTTCTTCCTGATCACTG 


317 


2536-2560 


MIE-5C 


CAGCACCATCCTCCTCTTCCTCTGG 


437 


2657-2633 



( a ^The use of primer MIE-4 with respectively primer MIE-5c 
15 and MIE-6c resulted in target DNA of 437 and 155 bp 

(k^The use of primer MIE-4 with respectively primer MIE- 
6a, MIE-6b, MIE-6c, MIE-6d, MIE-6e and MIE-5c resulted 
in capture nucleotide sequences of 56, 105, 155, 255, 
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(c) mie-4 was aminated at 5' end. 

( d ) Expressed as nucleotide number counted sequentially 
within the MIE gene. 

5 

[0068] Plasmid pAT153-E (containing the exon 4 of 

the MIE gene of HCMV DNA AD169 strain) was amplified by 
PCR, in a 100 \xl volume containing 1,5 mM MgCl 2 , 10 mM Tris 

pH 8.4, 50 mM KC1 , 1 [J.M of each primer, 100 joM of each 
10 dNTP, 2.5 U of Taq DNA polymerase Gold and 10 ng of plasmid 
pAT153-E. Samples were first denatured at 94 °C for 10 min 
to activate the polymerase. Then 40 cycles of amplification 
were performed consisting of 30 sec at 94 °C, 30 sec at 65 
°C and 30 sec at 72 °C and a final extension step of 10 min 
15 at 72 °C. Water controls or 100 copies of plasmid DNA were 
used respectively as negative or positive controls of the 
amplification. PCR of target DNA also includes 100 /iM of 
biotin-16-dUTP. 

[0069] The hybridisation step was carried out at 

20 65 °C for 2 h in the presence of 100 f moles of biotinylated 
target DNA. 

Example 3: Influence of the capture nucleotide sequence 
length on the yield of fixation of long- target amplicons 
25 (437 bp) 

[0070] The experiment was conducted as described in 

example 2, with a target of 437 bp, obtained by 
amplification of the Plasmid pAT153-E by using the primers 
MIE-4 and MIE-5c. Hybridisation was conducted on the 
30 biochips of example 2. 
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Example 4: Detection of FemA sequences from different 
bacterial species on the same array 

[0071] The FemA genes corresponding to the different 

5 Staphylococcus species were amplified separately by 
multiplex PCR using the following primers: 



s. 


aureus 1 : 


5' 


CTTTTGCTGATCGTGATGACAAA 3' 


s. 


aureus 2 : 


5' 


TTTATTTAAAATATCACGCTCTTCG 3 ' 


s. 


epidermidis 1 : 


5' 


TCGCGGTCCAGTAATAGATTATA 3 ' 


s. 


epidermidis 2 : 


5' 


TGCATTTCCAGTTATTTCTCCC 3 ' 


s. 


haemolyticus 1 


: 5' 


CTATGGTATTAGCGGTAATTTTAG 3 ' 


s. 


haemolyticus 2 


: 5' 


TTTAATCTTTTTGAGTGTCTTATAC 3 ' 


s. 


saprophi ticus 1 


: 5' 


TAAAATGAAACAACTCGGTTATAAG 3' 


s. 


saprophiticus 2 


: 5' 


AAACTATCCATACCATTAAGTACG 3' 


s. 


homlnls 1 : 


5' 


CGAC CAGATAACAAAAAAG C AC AA 3' 


s. 


hominis 2 : 


5' 


GTAATTCGTTACCATGTTCTAA 3' 


[0072] The 


location 


of these primers and their 



specificity for the different Staphylococcus are presented 
in figure 2 . 

20 [0073] The multiplex PCR was performed in a final 

volume of 50 /il containing: 1.5 mM MgCl2/ 10 mM Tris pH 

8.4, 50 mM KCl, 0.8 jjM of each primer, 50 |iM of each dNTP, 
50 /xM of biotin-16-dUTP) , 1.5 U of Taq DNA polymerase 
Biotools, 7.5% DMSO, 5ng of plasmid containing FemA gene. 

25 Samples were first denatured at 94 °C for 3 min. Then 40 
cycles of amplification were performed consisting of 3 0 sec 
at 94 °C, 3 0 sec at 60 °C and 3 0 sec at 72 °C and a final 
extension step of 10 min at 72 °C. Water controls were used 
as negative controls of the amplification. The sizes of the 

3 0 amplicons obtained using these primers were 116 bp for 
S. saprophiticus, 128 bp for S. aureus, 118 bp for 
S. hominis, 162 pb for S. epidermidis and 160 bp for 
S. haemolyticus. 
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[0074] The sequences of the capture nucleotide 

sequences were the same as the corresponding amplicons and 
single strands . 

[0075] Protocols for capture nucleotide sequences 

5 immobilisation, hybridisation and silver staining detection 
are the one described in example 2 . 

Example 5: Detection of 16S rRNA from different bacteria by 
copying of a small portion of the sequence 

10 [0076] A specific sequence of the 16S rRNA sequence 

of three different bacteria (E.coli, Bacteroides distasonis 
and Bifidobacterium longrum) was copied using both a 
starting nucleotide sequence and a blocking nucleotide 
sequence which hybridises on the 16S rRNA and stops the 

15 reverse transcription. The copy of a specific sequence was 
done on 2|ag of total RNA extracted from different bacteria. 
The following sequences were used as starting and blocking 
nucleotide sequences : 

- for E.coli 

20 Starting nucleotide sequence 
5 1 CTCTGAAAACTTCCGTGGATG 3 * 
Blocking nucleotide sequence 
5 ' GTATCTAATCCTGTTTGCTCCCCACGCT 3 1 

- for B .distasonis 

25 Starting nucleotide sequence 
5 1 TACGATCCATAGAACCTTCATCCC3 ' 
Blocking nucleotide sequence 
5 1 CCTGCTTCATGCGGTATTAGTCCGAC 3 ' 

- for B . longum 

3 0 Starting nucleotide sequence 
5 1 CCACCGTTACACCGGGAA3 1 
Blocking nucleotide sequence 
5 ' CTCTCGCTTGCTCCCCGATA3 ' 
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[0077] The starting nucleotide sequence for E.coli, 

Bacteroides dlstasonis and Bifidohacterium longuw are 
located respectively at position 995-1016, 420-44 3,658-675 
on the 16S rRNA. 
5 [0078] The blocking nucleotide sequence for E.coli, 

Bacteroides distasonis and Bifidobacterium longum are 
located respectively at position 778-797, 172-198, 441-460 
on the 16S rRNA. The length of the synthetized cDNA are 
respectively of 240 bases, 273 bases and 236 bases. 

10 [0079] In a. nuclease free microtube, 0.5 fig of the 

starting nucleotide sequence and 2 /ig of the blocking 
nucleotide sequence were added to 2 £tg total RNA extracted 
from bacteria. Nuclease free water was added to a final 
volume of 15 /zl . The reverse transcription was conducted by 

15 adding the following components to the annealed nucleotide 
sequence /template : 5 /il of 5X AMV RT Buffer (250 mM Tris- 
HC1 pH 8.3, 250 mM KC1 , 50 mM MgCl2 50 mM DTT and 2.5 mM 
Spermidine) , 40 units of Rnasin ribonuclease inhibitor ( 
Promega, Madison, US) , 1 mM dATP, 1 mM dCTP, 1 mM dGTP, 

2 0 .0.65 mM dTTP , 0.35 mM biotin dUTP and 3 0 units of AMV RT ( 

Promega, Madison, US) . The final volume was adjusted to 25 
/il with nuclease free water. The reaction mixture was mixed 
gently and incubated for 60 min at 60 °C. 

[0080] The single strands cDNA obtained from the 

25 different bacteria were hybridised and detected on the 
array as in example 21. 

[0081] The results showed a specific hybridization 

of the cDNA for each of three bacteria on their respective 
capture nucleotide sequence. There was no cross-reactions 

3 0 since their values were at the level of the backgroung. 
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CLAIMS 

1. Detection and/or quantification method of 
a nucleotide sequence from at last 4 other homologous 

5 sequences, and being possibly present in a biological 
sample, comprising the steps of: 

- amplifying or copying at least a part of an original 
nucleotide sequence present in the biological sample 
into target nucleotide sequences, each target nucleotide 

10 sequence being located at a different location of the 

original nucleotide sequence; 

- putting into contact the obtained target nucleotide 
sequences with corresponding capture nucleotide 
sequences bounded to an insoluble solid support, 

15 - characterised in that said capture nucleotide sequences 
have a (single stranded) length comprised between about 
4 0 and about 400 bases, and in that said capture 
nucleotide sequences are bound to the insoluble support 
according to an array with a density of at least 5 

20 different capture nucleotide sequences / cm2 surface 

solid support, and 

detecting and possibly quantifying a signal resulting 
from the formation of double stranded nucleotide 
sequences resulting from their hybridisation by 
25 complementary base pairing, 

2. The method of. claim 1, wherein the length 
of the capture and target nucleotide sequences is comprised 
between about 5 0 and about 3 00 bases, preferably between 
about 100 and about 200 bases. 

30 3. The method of claim 1 or 2 , wherein the 

length of the capture nucleotide sequences differs less 
than 50% to the length of the target nucleotide sequences. 

4. The method of any of the preceding claims 
1 to 3 , wherein the different target nucleotide sequences 
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to be detected present an homology between each other 
higher than 30%, preferably higher than 60%, more 
preferably higher than 80%. 

5. The method of any of the preceding claims, 
5 wherein the insoluble solid support is selected from the 

group consisting of glasses, electronic devices, silicium 
supports, plastic supports, compact discs, filters, 
metallic supports or a mixture thereof. 

6. The method of any of the preceding claims, 
10 wherein the length of the target nucleotide sequences to be 

detected and/or be quantified is determined by the use of 
specific primers . for the retro- transcription or 
amplification of the 3 ' end of the original RNA or DNA 
nucleotide sequences to be detected and/or quantified in 
15 the biological sample. 

7 . The method of any of the preceding claims 
1 to 5, wherein the length of the target nucleotide 
sequences to be detected and/or quantified is determined by 
the use of specific, primer and blocking nucleotide sequence 

20 for the retro- transcript ion of the original RNA nucleotide 
sequences to be detected and/or quantified in the 
biological sample. 

8. The method of any of the preceding claims, 
wherein the original nucleotide sequences to be detected 

25 and/or quantified in the biological sample are rRNAs, 
preferably selected from the group consisting of 16S, 23S, 
18S and 28S rRNAs. 

9. The method of any of the preceding claims, 
wherein the original nucleotide sequences to be detected 

30 and/or quantified in the biological sample are FemA 
specific genetic sequences of Staphylococci species, 
preferably the FemA genes of the strain Staphylococcus 
aureus, epidermidis, saprophyt icus , hominis and/or 
haemolyticus . 
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10. A diagnostic and/or quantification kit or 
apparatus, which comprises an insoluble solid support upon 
which single stranded capture nucleotide sequences 
(allowing a specific hybridisation with target nucleotide 
5 sequences to be detected and/or quantified) , are bounded, 
preferably by a covalent link, said single stranded capture 
nucleotide sequences being disposed upon the surface of the 
solid support according to an array with a density of at 
least 5 different bound single stranded capture nucleotide 
10 sequences / cm 2 surface of the solid support and in that 
said single stranded capture nucleotide sequences have a 
length comprised between about 40 and about 400 bases. 
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A) Copy a specific portion of 1 6S rRNA 

— ■ 1 6S rRNA 

5' >3' 5 » ^.3. 

starting probe blocking probe 

1 st strand cDNA synthesis 

5' 3' cDNA 

B) Copy of the 1 6S rRNA extremity 

— 1 6S rRNA 

starting probe 5 1 > 3 1 

1 

1 st strand cDNA synthesis 

5* 3' cDNA 

Hybridization on array 

Fig. 3 



